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Abstract 

This paper presents a sharp approximation of the density of long runs 
of a random walk conditioned on its end value or by an average of a func- 
tions of its summands as their number tends to infinity. The conditioning 
event is of moderate or large deviation type. The result extends the Gibbs 
conditional principle in the sense that it provides a description of the dis- 
tribution of the random walk on long subsequences. An algorithm for 
the simulation of such long runs is presented, together with an algorithm 
determining their maximal length for which the approximation is valid up 
to a prescribed accuracy. 



1 Introduction and notation 

1.1 Context and scope 

This paper explores the asymptotic distribution of a random walk conditioned on 
its final value as the number of summands increases. Denote X™ := (Xi, .., X n ) 
a set of n independent copies of a real random variable X with density p on 
R and S" := Xi + ...+X„. We consider approximations of the density of the 

vector Xf = (Xi, .., X fe ) on R k when S™ — n (a n \/Var~X. + £xj and a n is cither 
fixed different from or tends slowly to and k := k n is an integer sequence 
such that 

< lim sup k/n < 1 (1) 

n— >oo 

together with 

lim n — k = oo. (2) 

n— 7-oo 

Therefore we may consider the asymptopic behavior of the density of the tra- 
jectory of the random walk on long runs. For sake of applications we also 
address the case when S" is substituted by U" := / (Xi) + ... + / (X„) for 
some real valued measurable function /, and when the conditioning event writes 
(U? - n (a nV /yar/(X) + Ef (X)) ) . 



The interest in this question stems from various sources. When k is fixed 
(typically k = 1) this is a version of the Gibbs Conditional Principle which 
has been studied extensively for fixed a n , therefore under a large deviation con- 
dition. Diaconis and Freedman [5] have considered this issue also in the case 
k/n — !• 9 for < 6 < 1, in connection with de Finetti's Theorem for exchange- 
able finite sequences. Their interest was related to the approximation of the 
density of by the product density of the summands Xj's, therefore on the 
permanence of the independence of the X^'s under conditioning. Their result 
is in the spirit of van Camperhout and Cover [15] and to be paralleled with 
Csiszar's [1] asymptotic conditional independence result, when the condition- 
ing event is ^S" > n (^a n VVarX + E~Kj^j with a n fixed and positive. In the 

same vein and under the same large deviation condition Dembo and Zeitouni |3] 
considered similar problems. This question is also of importance in Statistical 
Physics. Numerous papers pertaining to structural properties of polymers deal 
with this issue, and we refer to [5] and [7] for a description of those problems and 
related results. In the moderate deviation case Ermakov 10 also considered a 
similar problem when k = 1. Although out of the scope of the present paper 
the result which is presented here is a cornerstone in the development of fast 
Importance Sampling procedures for rare event simulation; see a first attempt 
in this direction in [3J. In Statistics, M— estimators have the same weak be- 
havior as the empirical mean of their influence functions on the sampling points 
in the moderate deviation zone. Simulating samples under a given value of the 
M— estimator leads to improved test procedures under small p— values. 

We exhibit the change in the dependence structure of the X^'s under the 
conditioning as k/n — > 1 and provide an explicit and constructive solution to the 
approximation scheme. The approximating density is obtained as an adaptive 
change in the classical tilting argument combined with an adaptive change in 
the variance. Also when k = o(n) our result improves on existing ones since it 
provides a sharp approximation of the conditional density. The present result is 
optimal in the sense that it coincides with the exact conditional density in the 
gaussian case. 

The crucial aspect of our result is the following. The approximation of the 
density of X^" is not performed on the sequence of entire spaces M. k but merely 
on a sequence of subsets of M. k which bear the trajectories of the conditioned 
random walk with probability going to 1 as n tends to infinity; therefore the 
approximation is performed on typical paths. The reason which led us to consider 
approximation in this peculiar sense is twofold. First the approximation on 
typical paths is what is in fact needed for the applications of the present results 
in the field of simulation and of rare event analysis; second it avoids a number of 
technical conditions which are necessary in order to get an approximation on all 
M. k and which are indeed central in the above mentioned works; those conditions 
pertain to the regularity of the characteristic function of the underlying density 
p in order to get a good approximation in remote regions of K fc . Since the 
approximation is handled on paths generated under the conditional density of 
the Xi's under the conditioning, much is known on the region of M. k which is 
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reached with large probability by the conditioned random walk, through the 
analysis of the large values of the Xj's. 

For sake of numerical applications we provide explicit algorithms for the 
generation of such random walks together with a number of comments for the 
practical implementation. Also an explicit rule for the maximal value of k 
compatible with a given accuracy of the approximating scheme is presented 
and numerical simulation supports this rule; an algorithm for its calculation is 
presented. 



1.2 Notation and hypotheses 

In the context of the point conditioning 



E n := fs? = n (a n WarX + EXj 



the hypotheses are as below. The case when S" is substituted by U™ is post- 
poned to Section 3, together with the relevant hypotheses and notation. 

We assume that X satisfies the Cramer condition, i.e. X has a finite moment 
generating function <I>(i) := EexptX. in a non void neighborhood of 0; denote 

m(t) :=^log$(f) 



and 



At) := j t m{t). 



The values of m(t) and s 2 are the expectation and the variance of the tilted 
density 

n a (x) := ^Pix) (3) 

where t is the only solution of the equation m(t) = a when a belongs to the 
support of X, see Barnfoff-Nielsen [2] for details. Denote II a the probability 
measure with density ir a . 

We also assume that the characteristic function of X is in L r for some r > 1 
which is necessary for the Edgeworth expansions to be performed. 

The probability measure of the random vector X™ on M. n conditioned upon 
E n is denoted Cp n . We also denote *P„ the corresponding distribution of X^ 
conditioned upon E n \ the vector X^ then has a density with respect to the 
Lebesgue measure on M. k for 1 < k < n , which will be denoted p„, which might 
seem ambiguous but recalls that the conditioned distribution pertains to the 
value of S™, from which the density of Xj is obtained. For a generic r.v. Z with 
density p, we denote p (Z = z) the value of p at point z. 



This paper is organized as follows. Section 2 presents the approximation 
scheme for the conditional density of Xj under the point conditioning sequence 
E n . In section 3, it is extended to the case when the conditioning family of 
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events writes (U™ = n (a ny /Varf (X) + Ef (X)J J . The value of k for which 
this approximation is fair is discussed; an algorithm for the implementation of 
this rule is proposed. Section 4 presents an algorithm for the simulation of 
random variables under the approximating scheme. We have kept the main 
steps of the proofs in the core of the paper; some of the technicalities is left to 
the Appendix. 



2 Random walks conditioned on their sum 

We introduce a positive sequence e„ which satisfies 



lim e n y/n — k = oo (El) 

n— >oo 

lim e„ (logn) 2 = 0. (E2) 

It will be shown that e„ (logn) 2 is the rate of accuracy of the approximating 
scheme. 

We denote a the generic term of the bounded sequence (a„) n>1 , which we 
assume positive, without loss of generality . The event £ n is of moderate or large 
deviation type, since we assume that 

lim g = 00 ■ (A) 

e„ (logn) 

The case when a does not depend on n satisfies (A) for any sequence e„ 
under (El, 2). Conditions (A) and (El, 2) jointly imply that a cannot satisfy 
^fna — > c for some fixed c; the Central Limit zone is not covered by our result. 
In order that there exists a sequence e„ such that the approximation of p n holds 
with rate e„ (logn) 2 — » 0, a sufficient condition on a n is 

lim n ^ 2 = oo (4) 



(logn) 2 



which covers both the moderate and the large deviation cases. 

Under these assumptions k can be fixed or can grow together with n with the 
restriction that n — k should tend to infinity; when a is fixed this rate is governed 
through (El) (or reciprocally given k ,e„ is governed by k) independently on a. 
In the moderate deviation case for a given sequence a close to 0, e„ has rapid 
decrease, which in turn forces n — k to grow rapidly. 

In this section we assume that X has expectation and variance 1. For clear- 
ness the dependence in n of all quantities involved in the coming development 
is omitted in the notation. 
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2.1 Approximation of the density of the runs 

Let a = a n denote the current term of a sequence satisfying (A) . Define a density 
9a(Vi) on K fe as follows. Set 

go(yi\yo) ■= 7r a (yi) 

with y arbitrary, and for 1 < i < k — 1 define gi{yi+\ \ y\) recursively. 
Set U the unique solution of the equation 

mi := m(U) = — ^— I a ) (5) 

n — i \ n ) 

where s\ := yi + ... + y%. The tilted adaptive family of densities ir mi is the basic 
ingredient of the derivation of approximating scheme. Let 

d 2 

si := ^ (log E w m t exptX) (0) 

and 

d? 

Ii) := — 2 (log^m, expiX) (0) , j = 3,4 
which are the second , third and fourth centered moments of ir mi . Let 

9i(Vi+i\yi) = Cip(y i+1 )n(a + a/3,a,y i+1 ) (6) 
where n (/x, r, x) is the normal density with mean \x and variance r at X. Here 

o = (» - j - !) ( 7 > 

and Ci is a normalizing constant. 
Define 

fe-i 

5a(j/i) := n^(fi+i|j/i)- ( 9 ) 

i=0 

We then have 

Theorem 1 Assume that (El, 2) holds together with (A). Let Y™ be a sample 
with distribution Then 

p n (yfc) :=p(Xf = n fe | S? = na) = 5a(^)(l + o<p„ (e„ (logn) 2 )). (10) 

Proof. The proof uses Bayes formula to write p( = | S" = na) as a 
product of conditional densities of individual terms of the trajectory evalu- 
ated at Y±. Each term of this product is approximated through an Edgeworth 
expansion which together with the properties of Yj° under <}3 n concludes the 
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proof. This proof is rather long and we have differed its technical steps to the 
Appendix. 

Denote = 0, £} := Y 1 and £j := 1 + Y,. It holds 
p(Xf = J? I SJ = no) =p(X x = F x | SJ = no) 



(11) 



fc-i 



i+l 



V I Y* V* CI™ 

•*i+l| -A-l — Y l > 3 1 



i=l 



fc-1 



j+i 



^+ii 



Si 



i=Q 



using independence of the r.v's X^s. 

We make use of the following property which states the invariance of condi- 
tional densities under the tilting: For 1 < i < j < n, for all a in the range of X, 
for all u and s 



p si 



Define U through 



m(U) 



(12) 



SI 



a function of the past r.v's Yl and set m, := m(ij) and sf :— s 2 (ti). By (12) 

p(X s:+1 =r l+ i|s s n +1 =no-Si) 
= n mi (X,+i = S? +1 = no - El) 

^ (S? +2 = no - £j +1 ) 



7r m * (X i+ i = li+i) 



S? +1 = na - El) 



where we used the independence of the Xj's under 7r mi . A precise evaluation of 
the dominating terms in this lattest expression is needed in order to handle the 
product ( 11 ). 

Under the sequence of densities tt" h the i.i.d. r.v's Xj+i, X n define a 
triangular array which satisfies a local central limit theorem, and an Edge- 
worth expansion. Under ir mi , Xj_|_i has expectation rrii and variance sf. Center 
and normalize both the numerator and denominator in the fraction which ap- 
pears in the last display. Denote 7r n _j_i the density of the normalized sum 
(S" +2 — (n — i — l)?7ij) / (sjVn — i — 1) when the summands are i.i.d. with 
common density 7r mi . Accordingly 7r„_i is the density of (S™ +1 — (n — i)mj) / [si^/rT 
under i.i.d. 7r" li sampling. Hence, evaluating both 7r„_i_i and its normal ap- 
proximation at point li+i, 

p(X m =y i+1 |S? +1 =na-£l) (13) 

7T n _i_i ((mi - Yj+l) /Siy/n- i - l) 



\Jn — i 
y/n — i 



zir" 1 ' (X i+ i = YJ+i) 



i(0) 



(X, 



+1 



i+l) 



Ni 

Da 
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The sequence of densities 7r„_i_i converges pointwise to the standard normal 
density under (El) which implies that n — k tends to infinity, and an Edgeworth 
expansion to the order 5 is performed for the numerator and the denominator. 
The main arguments used in order to obtain the order of magnitude of the 
envolved quantities are (i) a maximal inequality which controls the magnitude 



of mi for all i between and k — 1 (Lemma 13), (ii) the order of the maximum 



of the Y[s (Lemma 14). As proved in the Appendix, under (A) 



Ni = </>( -Yi 



1 



.A.B + Vn 
where <f> is the standard normal density and, 



.4 



1 



aY,. 



i+i 



sf(n-i-l) 2sf(n-i-l) 




and 



"3 



B 



!)(«- 



2sf (n-j-1) 
15(a4) 2 



8sf(n-i-l) 72s?(n-i-l) 



Y i+ i) 

, Q-y„((log") 2 ) 



(14) 



(15) 



(16) 



-I) 2 



The 0<p n ( j^ j 1 1)3/2 ^ t erm i n (14) is uniform upon (m, — ii+i) /si-s/r 
Turn back to ( 13 ) and do the same Edgeworth expansion in the demo 
which writes 

/4-sf 15(a4) 2 \ „ / 1 



A = 0(0) 1 



8sf(n-i) 72sf(n-i) 



0» 



(n — i) 



3/2 



(17) 



The terms in gi ( l^+i | ) follow from an expansion in the ratio of the two expres- 
sions ( 14) and ( 17) above. The gaussian contribution is explicit in (fl4| ) while the 

term exp 2s i (n-i-i) ^ s ^ ne dominant term in B. Turning to (13[) and com- 



paring with (10) it appears that the normalizing factor Cj in gi{Yi + \ \ Y£) com- 

— =-= exp ( „ 2 7 aAl3 I , where the term ^(tA comes 



pensates the term 



from 7r" li (Xj+i = l^+i) ■ Fur ther the product of the remaining terms in the 



above approximations in (14) and (17) turn to build the 1 + o<p n |^e n (logn) 
approximation rate, as claimed. Details arc differed to the Appendix. This yields 



fe-i 



l + o<p„ (e n (logn) 2 )) n^(^+i|^ 



i=0 



which closes the proof of the Theorem. ■ 

Remark 2 When the 's are i.i.d. with a standard normal density, then the 
result in the above approximation Theorem holds with k = n — 1 stating that 

rn—\ _n— II on "\ „ t „n— 1\ /• „ji ri — 1 • mm 



p(xr 



S? = na) = 5a (a;'^ 1 ) /or a^Z cc™~ 



r/izs extends to 
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the case when they have an infinitely divisible distribution. However formula 



10) holds true without the error term only in the gaussian case. Similar exact 



formulas can be obtained for infinitely divisible distributions using (11) making 
no use of tilting. Such formula is used to produce Tables 1 and 2 in order to 
assess the validity of the selection rule for k in the exponential case. 

Remark 3 The density in is a slight modification of . The modification 
from ir" li to gi is a small shift in the location parameter depending both on 
a n and on the skewness of p, and a change in the variance : large values of 
X.; + i have smaller weigth for large i, so that the distribution of Xj+i tends to 
concentrate around rrii as i approaches k. 

Remark 4 In the previous Theorem, as in Lemma \14\ we use an Edgeworth 
expansion for the density of the normalized sum of the n—th row of some trian- 
gular array of row-wise independent r. v 's with common density. Consider the 
i.i.d. r.v's Xj., ...,X„ with common density ir a (x) where a may depend on n but 
remains bounded. The Edgeworth expansion pertaining to the normalized den- 
sity of S" under ir a can be derived following closely the proof given for example 
in jllf . pp 532 and followings substituting the cumulants of p by those of ir a . 
Denote (f a {z) the characteristic function of ir a (x). Clearly for any S > there 
exists q a .s < 1 such that \ip a (z)\ < q a _s and since a is bounded, swp n q a .$ < 1. 
Therefore the inequality (2.5) in 111]/ p533 holds. With tp n defined as in 11 If . 
(2.6) holds with if replaced by (p a and a by s(t a )', (2.9) holds, which completes 
the proof of the Edgeworth expansion in the simple case. The proof goes in the 
same way for higher order expansions. 

2.2 Sampling under the approximation 

Applications of Theorem [l] in Importance Sampling procedures and in Statistics 
require a reverse result. So assume that Y± is a random vector generated under 
G a with density g a . Can we state that g a (Yi) is a good approximation for 
Pn (Xi )? This holds true. We state a simple Lemma in this direction. 

Let £R„ and & n denote two p.m's on ]R n with respective densities t n and s n . 

Lemma 5 Suppose that for some sequence e n which tends to as n tends to 
infinity 

t n (Y 1 ")=5 n (y 1 ")(i + 05Hn ( £ri )) (is) 

as n tends to oo. Then 

s n = t„ (17) (1 + o en (e n )) . (19) 

Proof. Denote 

A n , En := Wl : (1 - e n )s n (y?) < r„ (y?) < s n (y?) (1 + e n )} . 
It holds for all positive S 

lim JH„(A n , fe J = 1. 
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Write 

(A nt5E J = J l AniSen (tf) ^M Sn (yn )dy n 

Since 

£Rn {A n ,seJ < (1 + 5e n )G n (A„ tSsn ) 

it follows that 

lim & n (An 5e n ) = 1, 
n— >oo 

which proves the claim. ■ 

As a direct by-product of Theorem [T] and Lemma [5] we obtain 

Theorem 6 Assume (A), (El, 2). Then when Yj" is generated under the dis- 
tribution G a it holds 

Pn {Yf) = g a {Y?)(l + o Ga (e n (logn) 2 )) 

with p n defined in Ut 

3 Random walks conditioned by the mean of a 
function of their summands 

This section extends the above results to the case when the conditioning event 
writes 

U? := / (Xi) + ... + / (X„) = n (era + fi) . (20) 

The function / is real valued, Ef (X) = [i and Varf (X) = a 2 . The character- 
istic function of the random variable / (X) is assumed to belong to L r for some 
r > 1. As previously a is assumed positive. Let px denote the density of the 
r.v. X. 
Assume 

4> f (t) := Eexptf (X) < oo 

for t in a non void neighborhood of 0. Define the functions mf(t), s 2 (t) and 
Hf,3(t) as the first, second and third derivatives of \og<f>f(t). 
Denote 

exptf(x) 

with rrif(t) = a and a belongs to the support of Pf, the distribution of / (X) , 
with density pf. Conditions on <f>f(t) which ensure existence and uniqueness of 
t are referred to as steepness properties, and are exposed in [5] 
Assume that (A) holds and the sequence e„ satisfies (El, 2). 
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3.1 Approximation of the density of the runs 

Define a density h^a+^ivi) with c.d.f. H aa+fl on M. k as follows. Set 

ho (j/l 1 2/o) := 7T/ a+M (2/i) 

with 7/0 arbitrary and for 1 < i < k — 1 define hi(yi+i\ y\) recursively. 
Set U the unique solution of the equation 

. . n ( u\\ . . 

mi := mflti) = : aa + u (21) 

n — 1 \ n J 

where u\ := f (y{) + ... + / (y 4 ) . 
Define 

hi(Vi+i\y\) = CiPx(y i+ t)n(aP + (aa + fi) ,a, f(y i+ i)) (22) 
where Ci is a normalizing constant. Here 



a = s 2 f (U)(n-i-l) (23) 



P^ + vvrfr^- iv (24) 

Set 

fe-i 

K a+ »(y$) ■■= Hhiivi+ilvi). (25) 

Denote the distribution of X™ conditioned upon (U™ = n (aa + fi)) and 
pi its density when restricted on M fc ; therefore 

pf n (Xf = F*) := p(X{ = y*| U? - n (aa + (26) 

Theorem 7 Assume (04 J and (El, 2) . Then (i) 

pi (X{ = i?) = + Oqj/(e„ (logn) 2 )) 

and (mJ 

pf n (X* = y x fe ) = /w„(l?)(l + ^ o+M (e„ (logn) 2 )). 

Proof. We only sketch the initial step of the proof of (i) , which rapidly follows 
the same track as that in Theorem [l] Denote \j{ := f(Yi) + ... + f(Yj). 
As in the proof of Theorem [T] evaluate 

P (Xi+i - Y i+1 \ U? +1 = n (aa + M ) - 1^) 

p(ur +2 = n(a a+M )-c/i +i ; 



= p(x i+ i = y+i) 



p (Uf +1 = n (aa + fx) - U{) 
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Use the tilting invariance under 7rJ li leading to 
p (X i+1 = Y i+1 \ U? +1 = n (aa + M ) - I/?) 



e «,/(r. + Trp (U? +2 = n (aa + M ) - U{ 



and proceed through the Edgeworth expansions in the above expression, follow- 
ing verbatim the proof of Theorem [I] We omit details. The proof of (ii) follows 
from Lemma [5] ■ 

3.2 How far is the approximation valid? 

This section provides a rule leading to an effective choice of the crucial parameter 
k in order to achieve a given accuracy bound for the relative error. The generic 
r.v. X has density px and / (X) has mean \i and variance a 2 . The density p£ 



is defined in ( 26 ) . The accuracy of the approximation is measured through 



ERE{k) := EHn ,„ l Dk (l?) P J± ^ Ka+ » ^ 



pi (Y*) 
and 

VRE(k) := Var Ka+ „ l Dk (Y>) P " J ^ (27) 

Pn (*?) 

respectively the expectation and the variance of the relative error of the approxi- 
mating scheme when evaluated on Dk , the subset of R h where \ h aa+tJ ,(Yi )/p( l (Y^ — 1 < 
6 n with e„ (logn) 2 /5 n —*0 and 5 n — > 0; therefore H aa+I1 (D k ) — > l.The r.v's Yf 
are sampled under h aa+ ^. Note that the density p£ is usually unknown. The 
argument is somehow heuristic and unformal; nevertheless the rule is simple to 
implement and provides good results. We assume that the set Dk can be sub- 
stituted by K fc in the above formulas, therefore assuming that the relative error 
has bounded variance, which would require quite a lot of work to be proved 
under appropriate conditions, but which seems to hold, at least in all cases 
considered by the authors. We keep the above notation omitting therefore any 
reference to Dk ■ 

Consider a two-sigma confidence bound for the relative accuracy for a given 
k, defining 



CJ(fc) := ERE(k) - 2y / VRE(k), ERE(k) + 2 y / VRE(k) 



Let 5 denote an acceptance level for the relative accuracy. Accept k until S 
belongs to CI(k). For such k the relative accuracy is certified up to the level 
5% roughly. 
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The calculation of VRE(k) and ERE(k) should be done as follows. 
Write 



VRE{kf = E px 



E, 



h 3 

"'aa+fj. 



y n V (n fc ). 



~A-B 2 . 

where n a is defined in By Bayes formula 

■iw-nw y!"?"! ' 

(n-*)Pu?/n (o-o + W 

and na. 



(28) 

(29) 
(30) 

(31) 



The following Lemma holds; see 



Lemma 8 Let Xi, ...,X„ be i.i.d. random variables with common density p on 
K and satisfying the Cramer conditions with m.g.f. <f>. Then with (log0) (t) = u, 



. . y/n(j) n (t)exp—ntu^ 

^ (m)= (1+o(1)) 



Ps 



m t/ie range of the large or moderate deviations, i.e. when \u\\/n — > oo and \u\ 
is bounded from above. 



Introduce 



and 



D 



N := 



Px(a) 



Px (mjfe) . 



with mfc defined in (21 1. Define i by my (i) = era + [i. By (31 1 and Lemma [8] it 
holds 

The approximation of A is obtained through Monte Carlo simulation. Define 



A(Yf) := 



2 „2 



sj(t k ) 



(32) 



and simulate L i.i.d. samples Y^{1) , each one made of i.i.d. replications under 
Px; set 



A:=if>( y i fe (0) 



/=i 
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We use the same approximation for B. Define 



2 



B(Y k ) - ^ (^ V ( D \ s f^) 



and 



5: =iE B ( r i fe «) 



with the same Yi(l)'s as above. 
Set 



VJiE(fc) := A - [Bj (34) 

which is a fair approximation of VRE(k). 
The curve fc — > ERE(k) is a proxy for 

Pl { (yfc) - (y*) 



ERE(k) := £? haB+( , 
and is obtained through 

KRE{k) :=l-B. 
A proxy of CI{k) can now be defined through 



CJ(fc) 



ERE(k) - 2JVRE(k),ERE(k) + 2JVRE(k) 



(35) 



We now check the validity of the just above approximation, comparing CI{k) 
with CI(k) on a toy case. 

Consider /(x) = x. The case when p is a centered exponential distribution 
with variance 1 allows for an explicity evaluation of CI{k) making no use of 
Lemma [8j The conditional density p n is calculated analytically, the density g a 
is obtained through hence providing a benchmark for our proposal. The 
terms A and B are obtained by Monte Carlo simulation following the algorithm 
presented hereunder. Tables 1,2 and 3,4 show the increase in S w.r.t. k in the 
moderate deviation range, with a such that P (S™ > no) ~ 1CP 2 . In Table 5,6 
and 7,8, a is such that P (S™ > na) ~ 10~ 8 corresponding to a large deviation 
case. We have considered two cases, when n = 100 and when n = 1000. These 
tables show that the approximation scheme is quite accurate, since the relative 
error is fairly small even when approximating event is in spaces with very high 
dimension. Also they show that ERE et CI provide good tools for the assessing 
the value of k. 
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Figure 1: CI(k) for n=100 
and P(S? > na) ~ 1CT 2 . 



Figure 2: CJ(fc) for n=100 
and P(S? > na) ~ 1CT 2 . 



Figure 3: CI(k) for n=1000 
and P(S? > na) ~ 1CT 2 . 



Figure 5: C7(fc) for n=100 
and P (S? > na) ~ 1CT 8 . 



Figure 4: C7(fc) for n=1000 
and P (S? > na) ~ 1CT 2 . 



Figure 6: CI{k) for n=100 
and P (S? > na) ~ 1CT 8 . 
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Figure 7: CI{k) for n^lOOO 
and P(S? > no) ~ 1CT 8 . 



Figure 8: CI{k) for n=1000 
and P(S? > no) ~ 1CT 8 . 



We present a series of algorithms which produces the curve k — > RE(k) in 
the case when / (X) has expectation [i and variance a 2 . 



Algorithm 1 Evaluates the function h aa+ ^ 




INPUT 

vector Xi, integer n, density px, level a 

OUTPUT 

INITIALIZATION 

to '■= m* (era + fi) 
h (xi\ x?) := 7rJ Q+M (xi) 
T,\ := xi 
PROCEDURE 

For i from 1 to k — 1 

m, :=(21| 

U := ml (m,-) 

Calculate Cj in (22) through MonteCarlo 

/ii(x i+ i|xl) :=(22| 
endFor 
Compute 

Return /i CTO+M (xf) 





Remark 9 Solving tj :— mj (rrii) might be difficult, even through a Newton 
Raphson technique and time consuming in simple cases. It may happen that the 
reciprocal function ofuif is at hand, as is assumed in Dupuis and Wang Jfjj/, but 
even in such current situation as the Weibull distribution and f(x) — x, such is 
not the case. An alternative computation is presented in Algorithm 1 following 
an expansion in m/(ij), which is a good update since s/(tj_i) is stable around 
varf (X) in the case when a tends to Ef (X) or to s At) when a tends to or 
for fixed a, as follows from a variant of Lemma \13\ 
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Algorithm 1' 



Similar to Algorithm 1 with ti := m^ L {mi) substituted by 

+ j- i m/(t i _i)-a; i 



Algorithm 2 : Calculates kg 



INPUT 

density px, level a, efficiency 5, integer n, integer L 

OUTPUT 

kg 

INITIALIZE 

k = 1 
PROCEDURE 
Do 

For I from 1 to L 

Simulate Y^(l) i.i.d. with density px 



endFor 



A(Yf(l)) :=(32) using Algorithm 1 
B (y x fc (0) :=l 33 > using Algorithm 1 



Calculate CJ(fc) :=( 35 > 
k := k + 1 
While 5 ^ aT(fe) 
endDo 
Return kg:= k 



4 Simulation of typical paths of a random walk 
under a point conditioning 

By Theorem [7] (ii), h aa+ ^ and the density of under (U™ = n (era + fj,)) get 
closer and closer on a family of subsets of R fe which bear the typical paths of the 
random walk under the conditioning (U™ = n (era + //)) with probability going 
to 1 as n increases. By Lemma [H] large sets under are also large sets under 



H aa +^- It follows that longs runs of typical paths under defined in (26 1 can 
be simulated as typical paths under H aa+I1 at least for large n. 
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Algorithm 3 : Simulates a sample Y± with density h aa+ n 



integer n, density px, level a , accuracy 5 



INPUT 
OUTPUT 

Vector Yi 
INITIALIZATION 

Set k:= ks with Algorithm 2 
to := mj (era + /i) 
PROCEDURE 

Simulate Y\ with density izZ 



Si := Y 1 

For i from 1 to k — 1 



7 



(21) 



(23 



K) 



Simulate Yi+i with density yf) 

»+i - 
l 

endFor 

Return 



£1 +1 := EJ + y i+ i 



Algorithm 3' 



Similar to Algorithm 3 with ti :— m x 1 (mj) substituted by 



mf(ti—i^—Xi 
(n-i)s^(tj_i) 



Remark 10 TTie r.u. Yi can 6e obtained through Metropolis-Hastings algo- 
rithm; see also <1-1 which uses a truncated approximation. 

The following algorithm provides a simple acceptance/rejection simulation 
tool for Yi+i with density hi(yi+i \ y\); it does not require any estimation of the 
normalizing factor. Metropolis-Hastings may also be useful in complex cases. 

Denote 91 the c.d.f. of a normal variate with parameter (/z, a 2 ) ,and 9T _1 its 
inverse. 
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Algorithm 4 : Simulates Y with density proportional to p(x)n (u, <J 2 ,x) 




INPUT 

density p 

OUTPUT 

Y 

INITIALIZATION 

Select a density / on [0, 1] and a positive constant K such that 
p (OT^a;)) < Kf(x) for all a; in [0, 1] 
PROCEDURE 
Do 

Simulate X with density / 

Simulate U uniform on [0, 1] independent of X 

Z := KUf(X) 
While Z < p(0^- x (X)) 
endDo 

Return Y := ^(X) 





Tables 9,10,11 and 12 present a number of simulations of random walks con- 
ditioned on their sum with n — 1000 when f(x) — x. In the gaussian case, when 
the aproximating scheme is known to be optimal up to k = n— 1, the simulation 
is performed with k = 999 and two cases are considered: the moderate deviation 
case is when P(S™ > no) = 10 -2 (Table 9) and the large deviation pertains to 
P(S™ > na) = 10 -8 (Table 10). The centered exponential case with n = 1000 
and k = 900 is presented in Tables 11 and 12, under the same events. In order 
to check the accuracy of the approximation, Tables 13,14 (normal case, n=1000, 
k=999) and Tables 15,16 (centered exponential case, n=1000, k=900) present 
the histograms of the simulated X^s together with the tilted densities at point 
a which are known to be the limit density of Xi conditioned on £ n in the large 
deviation case, and to be equivalent to the same density in the moderate devi- 
ation case, as can be deduced from [10]. The tilted density in the gaussian case 
is the normal with mean a and variance 1; in the centered exponential case the 
tilted density is an exponential density on (— l,oo) with parameter 1/(1 + a). 
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Figure 9: Trajectories in the normal 
case for P(S? > na) = 1CT 2 
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Figure 11: Trajectories in the expo- 
nential case for P(S™ > na) = 10~ 2 
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Figure 10: Trajectories in the normal 
case for P(S? > na) = 1CT 8 
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Figure 12: Trajectories in the expo- 
nential case for P(S? > na) = 1CT 8 
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Figure 13: Distribution of X-s in the 
normal case for P(S™ > na) = 10~ 2 




Figure 15: Distribution of X^s in the 
exponential case for 
P(S? > na) = 10~ 2 



Figure 14: Distribution of X-s in the 
normal case for P(S™ > na) = 10~ 8 




Figure 16: Distribution of X-s in the 
exponential case for 
P(S? > na) = 10- 8 
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Consider now the case when f(x) = x 2 . Table 1 presents the case when X is 
JV(0, 1), n = 1000, k = 800, P (U? = n (a-y/2 + l)) - 10 -2 . We present the his- 
tograms of the X[s together with the graph of the corresponding titlted density; 
when X is N(0, 1) then X 2 is % 2 - It is well known that when a is fixed larger 
than 1 then the limit distribution of conditioned on (U™ = n (av2 + l)) 
tends to N (0, 1 + av2) which is the Kullback-Leibler projection of N(0, 1) on 
the set of all probability measures Q on K with J x 2 dQ{x) = ay/2 + 1. Now 



this distribution is precisely ho(yi\yo) defined hereabove. Also consider (22); 
expansion using the definitions ( 23 1 and ( |24[ ) prove that as n — > oo the domi- 
nating term in hi(yi + i \ y\) is precisely N (0, 1 + ay2) , and the terms including 
yf+i in the exponential stemming from n (a/3 + (era + fx) , a, f(yi+i)) are of or- 
der 0(1/ (n — i)); the terms depending on y\ are of smaller order. The fit which 
is observed in Table 17 is in concordance with the above statement in the LDP 
range (fixed a) , and with the MDP approximation following Ermakov; see [10] . 




Figure 17: Distribution of X.^s in the 
normal case for P(S" > na) = 10~ 2 
and for f(x) = x 2 



Remark 11 The statistics U™ may be substituted by any regular M- estimator 
when a = a n defines a moderate deviation event, namely when (A) holds together 
with a n — ¥ 0. In this case it is well known that the distribution of any regular M- 
estimator is similar to that of the mean of its influence function evaluated on the 
sample points; this allows to simulate samples under a given model conditionally 
on an observed value of a statistics, when observed in a rare area under the 
model. 



5 Conclusion 

We have obtained an extended version of Gibbs conditional principle in the 
simple case of real valued independent r.v's conditioned on the value of their 
mean or on an average of their images through some real valued function. The 
approximation of the density of long runs is shown to be quite accurate and 
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can be controlled through an explicit rule. Algorithms for the simulation of 
these runs are presented, together with numerical examples. Applications to 
Importance Sampling procedures for rare event simulation is a first applica- 
tion of this scheme; it mainly requires to consider conditioning events of the 
form (S™ > n (aa n + fi)) instead of (S™ = n (aa n + [/,)); first numerical results 
obtained in [3| show a net gain in the variance for IS estimators using an exten- 
sion of the present scheme. Extension to the multivariate setting is obtainable, 
requiring slight modifications. The case when the conditioning event is in the 
CLT zone deserves attention due to its interest to statistics. Simulation of X™ 
under a given pg in a model (pg, 9 <E 6) may lead to conditional test when a is 
substituted by the observed value of a given statistics. The present case when 
conditioning under a moderate deviation event is of interest for accurate asses- 
ment when the observed statistics has small p— value under a given hypothesis. 
Some other potential application to statistics is related to test procedures in 
presence of nuisance parameter, considering conditional tests under a sufficient 
statistics for the nuisance. 

A Three Lemmas pertaining to the partial sum 
under its final value 

We state three lemmas which describe some functions of the random vector X" 
conditioned on £ n . The r.v. X is assumed to have expectation and variance 
1. 

Lemma 12 It holds fip„ (Xi) = a,fip„ (X X X 2 ) = a 2 + (A) . fip n (X?) = 
s 2 (t) + a 2 + (i) where m(t) = a. 

Proof. Using 



normalizing both 7r|„ (na — x) and 7Tgn (na) and making use of a first order 



joint density *p„(Xi = x, X2 = y), using the same tilted distribution n a it 
readily follows that the last result holds. We used the fact that a n is a bounded 
sequence. ■ 

The following result states the behavior of the moments of ir nii . 

Lemma 13 Assume (A) and (El). Then maxi<Kit = a + otp^ (e n ) . Also 
maxi<i<fc sf , maxi<i<fc ^ and maxi<i<fc \i\ tend in probability to the vari- 
ance, skewness and kurtosis of p when a — a n — > and remain bounded when a 
is fixed positive. 



p S j (na - x)p Xl (x) 
p s ™ (na) 



tt|„ (na - x) tt&j (x) 





22 



Proof. Define 



V i+ i := m{ti) - a 



We state that 



— — — a. 



n max \V i+1 1 =o<p„ (e n ) , (36) 

0<2<fc— 1 



namely for all positive 5 

lim «p„ ( max > Se n ) = 

which we prove following Kolmogorov maximal inequality. Define 
Ai := ((\V +1 \ > Se n ) and {\Vj\ < Se n for all j < i + 1)) . 

from which 

max |Vf +1 | > Se n ) = II A t . 



i=0 



It holds 



Ey n V k 2 = f V 2 d% t + f V k 2 d^ n 

JuAi J(uAi) c 
2 



> 



(V 2 + 2{V k - Vi) Vi) d%, + / (V 2 +2(V k - Vi) V) d^ n 
uAi J(uA % y 

> I v 2 dy n 

JuA, 

fc-1 
3=0 

= 5 2 e 2 l < # n ( max \V i+ i\>Se r 

\Q<i<k—l 

The third line above follows from EVi (V k — Vi) = which is proved hereun- 
der. Hence 



where we used Lemma[l2j therefore p6[ ) holds under (El). By direct calculation, 
we can show that E<$ n (Vi(Vk — Vi)) = 0, which achieves the proof. ■ 

We also need the order of magnitude of max (Xi, X&) under Cp„ which is 
stated in the following result. 

Lemma 14 For all k between 1 and n, max(Xi, ...,Xfc) = 0<p„ (log/c) . 
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Proof. For all t it holds 

(max(Xi,...,X fc ) >t) < k<# n (X„ > t) 



, f°° a, v x ^(S? -1 = na„ - u) , 



na„) 



Let t be such that m(r) = a. Center and normalize both S™ and S™ _ with 
respect to the density 7r a in the last line above, denoting 7r£ the density of 
S™ := (S™ — na n ) /s^y/n when X has density TT a with mean a and variance 
(s( Q )) , we get 

rrr poo 

q3 n (max(Xi,...,X fc ) > t) < k ^== / ir a (X„ - u) 

y/n-l J t 

(ST 1 = (no - u - (n - l)a) / (*(«•) V^T)) 

. r -d,U. 

K (S? - 0) 

Under the sequence of densities 7r a the triangular array (Xi,...,X„) obeys a 
first order Edgeworth expansion 

(max (Xi, X fc ) > t) < k^= f ir a (X„ = u) 

n ((a - u) /s^yf^T) P (u, i, n) + o(l) 



n(0) + o(l) 



/,-f.sf / tt" (X„ = 



/oo 
7T° (X„ 

for some constant Cst independent of n and r and 

P (u, i, n) := 1 + P 3 ((a - «) /« (a) Vn 7 !) 

where Pz{x) = ^f^y ( x3 — 3a;) is the third Hcrmitc polynomial; /j,^ is the 

third centered moment of 7r a . We used uniformity upon u in the remaining 
term of the Edgeworth expansions. Making use of Chernoff Inequality to bound 

n a (x„ > t) , 

<P„ (max (Xi,..., X fc ) >t) <kCst^-p-e- xt 

for any A such that (f>(t + A) is finite. For t such that 

t/logk — > oo 

it holds 

<P„(max(X 1 ,...,X fe ) <t) -> 1, 
which proves the lemma. ■ 
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B Proof of the approximations resulting from 
Edgeworth expansions in Theorem 1 



We complete the calculation leading to ( 15 ) and ( 16 ) 



Set Z i+ i := (rrii - Y i+1 ) js^n-i- 1. 
It then holds 



TTn-i-l {Zi + x) = n(Z i+ i) 



O, 



1 + 7ra p 3(^+i) + ^4=1^(^+1) 
+ (— rij372 Pa(Zi+i) 

1 \ 



(37) 



(n-i-1) 



3/2 



We perform an expansion in n(Zi+i) up to the order 3, with a first order 
term n {—Y i+ x/ (s^n — i — l)) , namely 

n(Z i+ i) = n (si - i - l) ) (38) 
/ 1 , m| / y, 2 +1 A \ 

^ »J(n-t-l) ^ 2s?(n-i-l) ^ «?("-»-!) / 
y + 6sf(n-i-l) 3/2 n(-Y i+ i/(s iv / n-i-l)) / 

where Y* = — 7 J^= (-Y i+1 + 6m,) with |0| < 1. 

Lemmas [13] and [14] provide the orders of magnitude of the random terms in 
the above displays when sampling under 

Use those lemmas to obtain 



Yi+irrii Y l+ i 

~T7 ■ 77 = : 7 « + (£n 

(n — i — 1) tl — % — 1 



(39) 



and 



'"7 



sf (n — i — 1) n — i - 



-y (a + oqj„ (e„)) 



Also when (A) holds then the dominant terms in the bracket in ( 38 1 are precisely 
those in the two displays just above. This yields 



n(Z i+ i) = n 



-Yi 



1 



sf(n-i-l) 2s?(n— 
~T n-i-1 



We now need a precise evaluation of the terms in the Hermite polynomials 



in (37 1. This is achieved using Lemmas 13 and 14 which provide uniformity 



upon i between 1 and k = k n in all terms depending on the sample path Yf. 
The Hermite polynomials depend upon the moments of the underlying density 

7r mi . Since 7r™ i has expectation and variance 1 the terms corresponding to P\ 

(») 

and P2 vanish. Up to the order 4 the polynomials write = r , 3 rg H^x), 
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72( Si y 



;H 6 (x) 4 
6x 2 — 3 and H e (x) 



24( Sl ) 

x 6 - 15x 4 



Hi{x) with H 3 (x) := x 3 - 3x, H 4 (x) 



45x 2 - 15. 



Using Lemma 13 it appears that the terms in x J , j > 3 in P3 and P4 will play 
no role in the asymptotic behavior in (37) with respect to the constant term 
in P4 and the term in x from P3 . Indeed substituting x by Zi + \ and dividing 
by n — i — 1, the term in x 2 in P4 writes Oqj„ (logn) 2 /(n — i) 2 where we used 
Lemma |13| These terms are of smaller order than the term —3a; in P 3 which 



writes 

It holds 



y/n — i — 1 



Ma 



2sj (n- 



1) 



(mi - F l+ i) 



6( Sl ) 6 («-*-l) 2 



which yields 

P 3 (^+i) 
yn- 



M3 



1 



24 (n 



1 



For the term of order 4 it holds 

Pi{Z l+ i) = 1 

n—i—1 n — i 

which yields 



i+lj 



1 V 72s 



1 

(n - i - 1) 
j 24s? 



0<p„(logn) 3 . (40) 



P±{Z l+ i) 
n — i — 1 



M4 - 3sf 



15(4) 2 



0<p„ ((logn) 



8a?(n-*-l) 72sf(n-i-l) ( n 



If 



(41) 



The fifth term in the expansion plays no role in the asymptotics, under (A). 



To sum up , under (A), and comparing the remainder terms in (40 1 and (41 ), 
we get 



— (Z l+1 ) = n -Y i+1 / s.Vn - i - 1 .A.B + 0~ 



1 

(n — i — 1) 



3/2 



where A and B are given in (15 1 and (16). 



C Final step of the proof of Theorem 1 

We make use of the following version of the law of large numbers for triangular 
arrays (see [14] Theorem 3.1.3). 

Theorem 15 Let X^ n ,1 < i < k denote an array of row-wise real exchange- 
able r.v's and lim^-xx, k = oo. Let p n := EX\^ n Xi^ n . Assume that for some 
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finite r , EX\ n < T. If for some doubly indexed sequence (cti, n ) such that 
limn^oo J2i=i a i,n = '* holds 



then 

in probability. 
Denote 



lim p n V aj = 



lim ^ dLnXi n = 



K n := 



_M_3 

2s: 



4 ! K 2 

* _ i a 
t 1 ! '■— K l + ^2 



,4-4 , i5(^) 5 



8s? 72s« 



^2 :— K l 



2s? 



By (|13|, (|14J) and fl7|) 
with 



Vn — i m < n (s lV /«-«-i 



1 



%/n — i — 1 

jJ^i + l _ A'2 a «2 I ( £ " lo g ") 

n — i— 1 n—i—1 n—i—1 n—i — 1 



n(0) 



1 - + Cm 

n—i Vr. 



^(n-i) 3 / 2 ^ 

We perform a second order expansion in both the numerator and the denomi- 
nator of the above expression, which yields 



A{i) = exp 



-i-l 2sf(n-i-l) 



exp 



n — i — 1 



exp 



The term exp 



\ n-i-1 ^ 2sf (n-i-1) 



Qq3 n (en log 
(42) 



A' 



in (42 1 is captured in gi(Yi + i\ Yf). 



The term A'{i) in (42) writes 



with 



Q\ ■= exp - 



(n—i—l)(n—i) 



(4) 2 i 1 ( Ml^.+ l 
2{n-i) 2 T 2 1 n-i-1 



n—i—1 n—i — 1 
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and 

i exp Bi 



expB 2 

where 



(n — % — \y (n — % — \y 

V*2 a I , x. °V„( e n ( lo g^)) 2 , / 2x 

r o<p„(e„logn) + ; ho^J 



(n-i-l) 2 b ; (n-i-l) 2 



with 

u _ Mpj+i M|a Kg , Qq3„(£n log") 

n — i — 1 n — i — 1 n — z — 1 n — j — 1 

We first prove that 

fc-i 



as n tends to infinity. 
Since 



fe-i fc-i fc-i 



P (x{ = F/i^+i = ™n) = n ^ {Y i+ i\ yd n a '^) n l * 



i=0 i=0 i=0 

where 



„ , . = exp , 

$ (ii) v 7 " - i - 1 V n-i-l 

the completion of the proof will follow from 

fe-l 



! [ I 1 • <*p n (e„ (logn) 2 ). 



i=0 



The proof of (43 1 is achieved in two steps. 
Claim 16 nto Q\ = 1 + °^( e » (log™) 2 )- 
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By Lemma 13 the random terms deriving from n mi satisfy 

maxx<i< k \n) - nj \ = oqj n (l) 

as n tends to oo, where /ij is the j-th centered moment of p. Therefore we may 
substitute /A by Hj in order to check the convergence of all subsequent series. 
Developing Ql, define, for any positive j3i, /3%, /3q and fii 



A.,., 



i 



e„ (log n)' 



fc-i 

E 



fc-i 



1 , s 

e n (logn) i=Q 



and 



A„ : — 



1 2 e 

e„ (log n) i=0 



k-l 

E 



(n — i — l)(n — i) 

(M2«) 2 



(41 2 

(n - i - l) 2 



(n - i - l) 5 



(n- i- l) 2 



L e„(logn) . =0 
It clearly holds that 

lim % l (Ai) = l; j 1 1 

Let for any positive /3 5 



</3 3 



</?4 



fe-1 



1 2 e 

e n (logn) i=Q 



„i „i \r 



</?5 



(n-i- l) 2 

If limjj^oo <p n (A^) = 1, then lim,^^ (A^) , j = 6, 7 where 



fc-i 



A„ \ — 



A„ : — 



1 2 e 

e n (log n) i=0 



fe-i 



1 2 e 

e n (logn) ^ 



(n - i - l) 2 



Ml>2 a ^+1 



(n-i - l) 2 



< A, 



</?7 



Apply Theorem 15 with Xi <n = l^+i and a. 



^logn) (n— z— l) 2 



. By Lemma 



E<$ n Y? = s 2 (0)+a + O 



Hence E<p n [Y 2 ] < L for some finite L. Further p n = a 2 + (-) . Both conditions 
m Theorem [15] are fullfilled. Indeed 
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lim a 2 n i = lim = 

which holds under (El), as holds 

( k \ 2 a 2 
lim p n V a„ ;i = lim — — = 0. 

V^Tl / n ^°°e2(logn) (n-k) 2 
Therefore, for i between 5 and 7, we have 

lim <p„ = 1. 

n— foo 

Define for any positive (3$ 

k ~ 1 ^„t\ 2 v2 



A„ : — 



Apply Theorem 15 with X ijV , = Y? +1 and a z . n = e?i( i ogK) 2 1 ( „_ I _ 1)2 
It holds 



lim V a 2 nl = 



n— ^oo 

»=1 



when (El) holds. 
By Lemma [12] 

Ey n Y* = E„aY* + o' ' 



4 



which entails that for some positive constant Y such that EY* < T < 00. Also 
E Vn (Y 2 Y 2 ) = (.^(0) + a) ( S » + a)+0 



= 



and 

™ (mIos^ § (n-'-l) 2 ) 
which holds under (El). Hence 

lim <£„ (A 8 n ) = 1. 

n— ^oo 

It follows that, noting A„ the intersection of the events A l n , j = 1, ...,8 

lim qj„ (A„) = 1. 
To sum up, we have proved that, under (El), 

Ql = 1 + o<p„ (e„ (logn) 2 
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Claim 17 n to Qh = 1 



{e n (logn)' 



This amounts to prove that the sum of the terms in Bl (resp in B2) is of 
order o<p„ (e n (logn) 2 ) . 

The four terms in the the sum of the terms in Bl are respectively of order 
oq?™ ( e n( lo g n ) 4 ) /(n>- k )-, 0< 9 n (en(logn) 3 ) /(n - k), c*p n (ae„(logn) 2 ) /(n - k) 
and o<p n (e„(logn) 2 ) /(n — k) using Lemma 13 The sum of the terms o (it 2 ) is of 

order less than those ones. Assuming (El) all those terms are o<p n ^e„ (logn) 2 ) . 

For the sum of terms B2, by uniformity of the Edgeworth expansion with 
respect to Yf it holds £)* =1 m = °<Pn 
by (El). 



(n — k) x ^ 2 ) which is o«p n (logn)' 



We now turn to the proof of ( 44 1 
Define 

*4 



{x-af 



2s\ (n-i-1) 2s? (n-i-1) 



Use the classical bounds 
1-u 



u 
~2 



u 



< e~ u < 1 



u 



to obtain on both sides of the above inequalities the second order approximation 
of C^ 1 through integration with respect to p. The upper bound yields 



CI 1 < $(*,) + K{ + 2 . -. ($" (U) - 2a$' (t<) + a 2 ) 

n — I — 1 s; i! — i — 1 



f(n-i-l) 

/ 1 \ 



n-i-1) 



from which 

r . Vn 
Li < = exp 



- i { aK i \ I 1 + n-i-i 111 * 

2 i ™2 



/ : T" " I „ „' ill s ; +m;-2omi+o 2 , n f 1 \ 

^ n - 1 - 1 V n - » - 1 ; ^ - - ■ 2s J (n _i-i) + Oy„ ( (— ^ j 

where the approximation term is uniform on the Yf, 

Subsituting ^"7\ and ex P (~ n-^-i ) their expansion 1 + 2 ( n -i-i) 

{ (n-l-lf ) and 1 - ^tl + (n-i-1)' + ° ( (n-«ll)» ) in tlie U PP CT b ° Und ° f 

Li above yields 

r < A , 1 «ci , Kl) 2 , _ f 1 



2{n-i-\) n-i-1 2(n-i-l) 2 \(n - i - 1 

n\rrii sf + m,f — 2arrii + a 2 / 1 

1 H : r ~ ~ ; — \~ Otp ra 



n — i — 1 2s 2 (n — i — 1) \{n — i — 1 
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Using Lemma 13 mf — 2am.i + a 2 = oqj re (ae„) and therefore 



/ 1 an\ (a^l) 2 

1 " ^ + 2{n - i - 1) ~ n - i - 1 + {n - i - l) 2 + ° \ {n - i - If 

l n\a 1 Qq3„(ae«) 

n — i — 1 2(n — i — 1) n — z — 1 

Write 

k k 

n^<n( i+M *) 

i=l i=l 

with 

1 (n — i — l) 2 tz — z — 1 

Under (A) and (El), J2iZo * s 0< $ n ( e n (l°g n ) 2 ) • This closes the proof of 
the Theorem. 
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